Discretization of Time Series Data

نویسندگان

  • Elena S. Dimitrova
  • Paola Vera-Licona
  • John McGee
  • Reinhard C. Laubenbacher
چکیده

An increasing number of algorithms for biochemical network inference from experimental data require discrete data as input. For example, dynamic Bayesian network methods and methods that use the framework of finite dynamical systems, such as Boolean networks, all take discrete input. Experimental data, however, are typically continuous and represented by computer floating point numbers. The translation from continuous to discrete data is crucial in preserving the variable dependencies and thus has a significant impact on the performance of the network inference algorithms. We compare the performance of two such algorithms that use discrete data using several different discretization algorithms. One of the inference methods uses a dynamic Bayesian network framework, the other-a time-and state-discrete dynamical system framework. The discretization algorithms are quantile, interval discretization, and a new algorithm introduced in this article, SSD. SSD is especially designed for short time series data and is capable of determining the optimal number of discretization states. The experiments show that both inference methods perform better with SSD than with the other methods. In addition, SSD is demonstrated to preserve the dynamic features of the time series, as well as to be robust to noise in the experimental data. A C++ implementation of SSD is available from the authors at http://polymath.vbi.vt.edu/discretization .

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Time-Discretization of Time Delayed Non-Affine System via Taylor-Lie Series Using Scaling and Squaring Technique 293 Time-Discretization of Time Delayed Non-Affine System via Taylor-Lie Series Using Scaling and Squaring Technique

A new discretization method for calculating a sampled-data representation of a nonlinear continuous-time system is proposed. The proposed method is based on the wellknown Taylor series expansion and zero-order hold (ZOH) assumption. The mathematical structure of the new discretization method is analyzed. On the basis of this structure, a sampled-data representation of a nonlinear system with a ...

متن کامل

Finding Persisting States for Knowledge Discovery in Time Series

Knowledge Discovery in time series usually requires symbolic time series. Many discretization methods that convert numeric time series to symbolic time series ignore the temporal order of values. This often leads to symbols that do not correspond to states of the process generating the time series. We propose a new method for meaningful unsupervised discretization of numeric time series called ...

متن کامل

An Evaluation of Discretization Methods for Non-Supervised Analysis of Time-Series Gene Expression Data

Gene expression data has been extensively analyzed using non-supervised machine learning algorithms, with the objective of extracting potential relationships between genes. Many of these algorithms work with discretized versions of the expression data. However, the many possible methods that can be used to discretize the data have not been comprehensively studied. In this paper, we describe a n...

متن کامل

Control-Relevant Discretization of Nonlinear Systems With Time-Delay Using Taylor-Lie Series

A new time-discretization method for the development of a sampled-data representation of a nonlinear continuous-time input-driven system with time delay is proposed. It is based on the Taylor-Lie series expansion method and zero-order hold assumption. The mathematical structure of the new discretization scheme is explored and characterized as useful for establishing concrete connections between...

متن کامل

Simplicial Multivalued Maps and the Witness Complex for Dynamical Analysis of Time Series

Topology-based analysis of time-series data from dynamical systems is powerful: it potentially allows for computer-based proofs of the existence of various classes of regular and chaotic invariant sets for high-dimensional dynamics. Standard methods are based on a cubical discretization of the dynamics and use the time series to construct an outer approximation of the underlying dynamical syste...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Journal of computational biology : a journal of computational molecular cell biology

دوره 17 6  شماره 

صفحات  -

تاریخ انتشار 2010